Link-space formalism for network analysis 
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We introduce the link-space formalism for analyzing network models with degree-degree corre- 
lations. The formalism is based on a statistical description of the fraction of links hj connecting 
nodes of degrees i and j. To demonstrate its use, we apply the framework to some pedagogical net- 
work models, namely, random-attachment, Barabasi- Albert preferential attachment and the classical 
Erdos and Renyi random graph. For these three models the link-space matrix can be solved analyt- 
ically. We apply the formalism to a simple one-parameter growing network model whose numerical 
solution exemplifies the effect of degree-degree correlations for the resulting degree distribution. 
We also employ the formalism to derive the degree distributions of two very simple network decay 
models, more specifically, that of random link deletion and random node deletion. The formalism 
allows detailed analysis of the correlations within networks and we also employ it to derive the form 
of a perfectly non-assortative network for arbitrary degree distribution. 
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Networks - in particular large networks with many 
nodes and links - are attracting widespread attention. 
The classic reviews 0, H, Q with their primary focus on 
structural properties have been followed up by more re- 
cent ones addressing the role of dynamics, such as spread- 
ing and synchronization processes on networks, as well as 
the role of weights and mesoscopic structures, i.e. cliques 
and communities, within networks 0,0. Although sev- 
eral different measures for characterizing networks have 
been presented, for example in a recent survey [||, the 
simple concept of vertex degree remains unrivalled in 
its ability to capture fundamental network properties. 
When comparing the degrees of connected vertices, how- 
ever, one often finds that they are correlated, a quality 
that gives rise to a rich set of phenomena. Degree corre- 
lations constitute a central role in network characteriza- 
tion and modelling but, in addition to being important 
in their own right, also have substantial consequences for 
dynamical processes unfolding on networks. Given the 
increasing current interest in network dynamics, under- 
standing structural correlations remains important and 
timely. 

In this paper we provide a detailed mathematical for- 
malism for modelling degree-degree correlations within 
stochastically evolving, non-equilibrium networks. It is 
built around a statistical description of inter-node link- 
ages as opposed to single-node degrees. While correla- 
tions have been characterized in empirical and model net- 
works, most works devoted to analytical calculations of 
correlations in models, as pointed out in Q, have been 
performed only for particular cases. We start by provid- 
ing a brief overview of degree correlations for network 
structure and dynamics in Section |T] and discuss how the 
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work presented relates to existing studies. The link-space 
formalism, which lies at the core of this paper, is intro- 
duced in Section [TTJ The formalism comprises a master 
equation description of the evolution of a specific matrix 
construction, termed the link-space matrix, which charac- 
terizes degree-degree correlations within a network. The 
formalism can be implemented in a variety of ways. To 
demonstrate its use, we apply it to two well-known, non- 
equilibrium examples, namely random-attachment and 
Barabasi-Albert (BA) preferential-attachment networks 
[1,111 in Section HH and solve the so-called link-space ma- 
trix analytically for the steady state of these systems. 
In Section |TT1 we also apply it to the classical equilib- 
rium random graph of Erdos and Renyi [Io| (ER) which, 
interestingly, requires a full, time-dependent solution of 
the link-space master equations. The cumulative link- 
space introduced in Section [II] aids comparison between 
simulated and analytically-derived link-space matrices 
and could, in principle, be applied to empirical networks 
to better ascertain their degree correlations. Analytic 
derivation of link-space matrices allows detailed analysis 
of the degree-degree correlations present within networks 
as is demonstrated in Section IIIIl Within Section IIII1 
we also show how, for an arbitrary degree distribution, 
a link-space matrix can be derived which has no corre- 
lations present, i.e. the link-space is representative of a 
perfectly non-assortative network with that degree distri- 
bution. We then consider the counter-intuitive prospect 
of finding steady states of decaying networks in Section 
ITVl In Section El we introduce a simple one-parameter 
network growth algorithm, which is able to produce net- 
works whose degree distributions exhibit a wide range of 
power-law exponents via a redirection process similar to 
the model by Krapivsky and Redner ,11]. The model is 
interesting in its own right in the sense that it makes 
use of only local information about node degrees. Here 



2 



the link-space formalism allows us to identify the transi- 
tion point at which higher exponent degree distributions 
switch to lower exponent distributions with respect to 
the BA model. We conclude in Section PVTl To maintain 
readability, we postpone the more detailed mathematical 
derivations to Appendices IA1 IBl and [C | and refer to them 
where appropriate. 



I. OVERVIEW OF DEGREE CORRELATIONS 

A discussion of correlations might naturally start with 
the phenomenon of clustering. The highly influential pa- 
per of Barabasi ans Albert and many subsequent pa- 
pers focussed on the modelling and analysis of models 
which replicate power-law degree distributions observed 
in many real- world systems [H, d, IHj]- However, the 
BA preferential attachment mechanism (discussed in Sec- 
tion [nB| lacks certain key features observed in many of 
those systems, one such feature being clustering. Clus- 
tering is an important local statistic in many networks 
and reflects the connectedness of neighbours of a node. 
In a social context, if person Y is friends with both X 
and Z , one might expect some kind of link between X 
and Z . The measurement of the connectedness of neigh- 
bours of nodes is the clustering coefficient. This is often 
averaged over all nodes and a single value is given as the 
average clustering coefficient of the network [2j. Some 
node labelled r\ in an arbitrary undirected network has 
k v neighbours, between which there could be a possible 
k v (k v — l)/2 links. If y of these are actually present, the 
clustering of node 77 is given as C = 2y/ (fc^(fc^ — 1)). The 
clustering nature of a network can also be expressed as 
the average over all nodes of degree k giving a cluster- 
ing distribution (or spectrum), C'k- As noted by Klemm 
and Egufluz, in a BA network, the average clustering of 
a given node is independent of its degr ee II 31] . in con- 
trast to the findings of Fronczak et al. [l4j j . and tends 
to zero in the large-size (thermodynamic) limit. A net- 
work model which does exhibit high clustering is that of 
Watts and Strogatz in which a random rewiring_process 
is carried out on an initially regular lattice [15j . The 
networks generated by such a process feature short aver- 
age shortest path lengths between node pairs (the small- 
world effect). However, these networks do not exhibit a 
power-law degree distribution. Subsequently, there have 
been many efforts to build models which can encompass 
both of these features such as the non-equlibrium growing 
network model proposed by Klemm and Egufluz which 
features deactivation of a nodes availibility to be con- 
nected to . Other models such as that of Holme and 
Kim [l6[ and that introduced by Toivonen et al. [ItJ 
modified the original BA preferential attachment mech- 
anism, allowing further links between the new node and 
neighbours of the preferentially selected node. The social 
network model of Toivonen et al. produced communities 
with dense internal connections. Szabo et al. formulated 
a scaling assumption and a mean-field theory of cluster- 



ing in growing scale-free networks and applied it to the 
Holme and Kim mechanism [l8| . As discussed by Boguha 
and Pastor-Satorras, clustering in networks is closely re- 
lated to degree correlations [l9j. Infact, based on the 
work of Szabo et al., Barrat and Pastor-Satorras intro- 
duced a framework for computing the rate equation for 
two vertex correlations in the continuous degree and con- 
tinuous time approximation [7j. We shall now describe 
these degree correlations. 

Vertex degree correlations are measures of the statisti- 
cal dependence of the degrees of neighbouring vertices in 
a network. In general, n-vertex degree correlations, or n 
point correlations, can be fully characterized by the con- 
ditional probability distribution P(ki, ki, . . . ,k n \k = n) 
that a vertex of degree k — n is connected to a set of 
n vertices with degrees ki, fa, . . . , k n . Two and three- 
point correlations are of particular interest in complex 
networks as they can be related to network assortativity 
and clustering, respectively. More specifically, two ver- 
tex degree correlations (two point correlations) can be ex- 
pressed as conditional probability P(k'\k) that a vertex of 
degree k is connected to a vertex of degree k' . Similarly, 
three vertex degree correlations (three point correlations) 
can be fully characterized by the conditional probability 
distribution P(k' , k"\k) that a vertex of degree k is con- 
nected to both a vertex of degree k 1 and a vertex of degree 
k" . This implies that the degrees of neighbouring nodes 
are not statistically independent. Reliable estimation of 
P(k'\k) and P(k',k"\k) requires a large amount of data 
and, in practice, one often resorts to related measures. 
Instead of P(k'\k), the average degree of nearest neigh- 
bours, (k nn )k, of nodes with degree k is often measured. 
This can be formally related to P{k'\k) [H[20|. If (k nn ) k 
increases with k, high degree vertices tend to connect to 
high degree vertices. A network with this property is de- 
scribed as assortative or displaying positive degree corre- 
lations. If (k nn )k decreases with k, high degree vertices 
tend to connect to low de gree vertices (disassortative or 
negatively correlated) [19. |21|. An alternative to (k nn )k 
is to use a normalised Pearson's correlation coefficient of 
adjacent vertex degrees providing a single number mea- 
sure of assortativity as suggested by Newman to further 
classify networks [!, [2l| . These approaches are discussed 
in more detail in Section IIII1 To characterize three point 
correlations, instead of using P(k', k"\k), one can employ 
the clustering spectrum Ck, the average clustering coef- 
ficient of nodes with degree k, which can be related to 
P(k' , k"\k) 11911. In many real- world networks such as 
the Internet [201 ] , the clustering spectrum is a decreasing 
function of degree and while this is sometimes interpreted 
as a signature of hierarchical structure in a network, Sof- 
fer and Vazquez suggested that this is a consequence of 
degree-degree (two point) correlations that enter the def- 
inition of the standard clustering coefficient [221 ] . The 
authors introduced a different definition for the cluster- 
ing coefficient that does not have the degree-correlation 
'bias', i.e. a three point correlation measure that filters 
out two point correlations. Following the suggestion of 
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Maslov et al. [23j that these phenomena might arise from 
topological constraints rather than evolutionary mecha- 
nisms, Park and Newman demonstrated that dissasor- 
tative degree correlations observed in the Internet could 
be explained via the restriction of there being no dou- 
ble edges between nodes [3]. In contr ast, social net- 
works have been found to be assortative [17| . Similarly, 
Catazaro et al. observed that the network of scientific 
collaborations was assortative and presented a model to 
reproduce this feature (25j . 

Functional processes occurring on networks are influ- 
enced by degree correlations, highlighting the impor- 
tance of their role in complex networks. Egufluz and 
Klemm considered highly clustered scale-free networks 
and showed that correlations play an important role in 
epidemic spreading [26]. The time average of the frac- 
tion of infected individuals in the steady-state undergoes 
a phase transition at a finite critical infection probability. 
They related this critical threshold to the transmission 
probability and the mean degree of nearest neighbours 
of all nodes in the system, (k nn ), the conjectured crite- 
rion for epidemic spreading being related to the product 
of the two. The value (k nn ) scales with the system size 
in their highly clustered scale-free network more slowly 
than in the random scale-free which is a byproduct of 
the dissasortativity of the system. Consequently, whereas 
in random scale-free networks in which viruses with ex- 
tremely low spreading probabilities can prevail, the ab- 
sence of connections between highly connected nodes in 
highly clustered scale-free networks protects the system 
against epidemics [26|. Interestingly, Boguna et al. as- 
sert that any scale-free network of appropriate exponent 
will have diverging (k nn ) in the thermodynamic limit, re- 
sulting in no threshold properties for epidemic spreading 
regardless of the correlations within the network [27] in 
contrast to the earlier suggestion of Boguna and Pastor- 
Satorras [Hj]. Brede and Sinha induced correlations to 
Erdos-Renyi and scale-free networks by rewiring them 
appropriately to examine their dynamic stability. They 
mapped the adjacency matrices into Jacobian matrices 
and examined the largest eigenvectors, reflecting the de- 
cay rates of perturbations about an equilibrium state. 
They found that positive correlations reduced their dy- 
namical stability [291 ] . Similarly, Bernardo et al. induced 
negative degree correlations in scale-free networks whose 
links couple non-linear oscillators. Through analysis of 
the eigenratio of the Laplacians of such networks, they 
found that network synchronisability improved as the 
network was made more dissassortative |30|. Based on 
this result, they conjectured that negative degree correla- 
tions may emerge spontaneously as the networked system 
attempts to become more stable 3l|. The same authors 
found similar results to hold also for weighted networks 
(30| . Maslov and Sneppen found that in "interaction and 
regulatory networks, links between highly connected pro- 
teins were systematically suppressed, whereas those be- 
tween highly connected and low-connected pairs of pro- 
teins were favoured" , a topological organisation that in- 



creases the overall robustness of the network to pertur- 
bations through "localising deletous perturbations" [32l |. 
This is consistent with the findings of Berg et al. whose 
model of the evolution of protein interaction networks 
exhibited disassortativity consistent with their empirical 
findings [33| . 

Krapivsky et al. used a master equation method, 
in which rate equations for the densities of nodes of 
a given degree are employed, to investigate the st eady 
state of the B A preferential attachment mechanism [3J] . 
As illustrated in Section [Til this is a general method 
which can be applied to various growing network mod- 
els. To extend the method to encompass two-point cor- 
relations, Krapivsky and Redner applied the approach 
to the number N^i of nodes with total degree k con- 
nected to ancestor nodes of total degree I in a directed 
network They solved analytically the master equa- 
tions for the steady-state of a specific directed network 
growth model, namely the Growing with Redirection al- 
gorithm [Tl| . similar to the mixture model introduced 
in Section [Vj Boguna and Pastor-Satorras considered a 
link orientated description of two point correlations. For 
undirected graphs, they introduced a symmetric matrix 
whose elements E^^ represent the number of links con- 
necting nodes degree k to nodes degree I jl9| . Boguna 
and Pastor-Satorras related this matrix to the joint de- 
gree distribution P(k' \ k) but noted that finite size ef- 
fects make empirical evaluation of the matrix difficult, 
suggesting the use of the spectrum (k nn )k as a more 
suitable observable. Here, we introduce a similar ma- 
trix construction which can be applied to both the undi- 
rected and directed scenarios. We call this the link-space 
matrix. We show that it is possible to construct master 
equations to model the evolution of this matrix which can 
be applied to a wide variety of network evolution algo- 
rithms retaining important degree correlations which can 
be critical to the network's development. This framework 
is termed the link-space formalism and is introduced in 
Section [TTJ In certain cases, these master equations can 
be solved analytically providing a full time-dependent so- 
lution or a steady-state solution of the link-space matrix 
of the network. From these analytically derived link- 
space matrices both the degree and joint degree distribu- 
tions can be obtained allowing accurate analysis of degree 
correlations. The formalism also allows the derivation of 
the form of networks of predetermined correlation prop- 
erties such as a perfectly non-assortative network (Sec- 
tion 1111 A[) and the derivation of the steady-state solu- 
tions to network decay algorithms (Section UVj. The for- 
malism can also be employed in its iterative guise to pro- 
vide approximations to the steady-state when an analytic 
solution is not possible such as for the mixture model of 
Section [VI 

One and two-point correlations have natural physical 
counterparts within the network, specifically the nodes 
and links. The one-point correlations P(k) are simply re- 
lated to the fraction of nodes in the network with degree 
k. As described in detail within this paper, two-point 
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correlations P(k' \ k) are related to the fraction of links 
within a network connecting nodes of degree k with nodes 
of degree k', hence the term link-space. One can extend 
the analysis to three-point correlations, P(k',k" | k). 
This would be related to the number (fraction) of pairs of 
links within a network sharing a common node of degree 
k, the remaining link ends being connected to nodes of 
degrees k' and k" and all possible 'open triangles' within 
the network would have to be considered. Clearly, the 
process can be extended to arbitrary n point correlations 
although the physical interpretation of the appropriate 
measurable quantities will become increasingly obscure. 



II. NODE-SPACE AND LINK-SPACE 

We now introduce the link-space formalism. Consider 
a simple, growing, non-equilibrium network in which one 
node is added to a network at each timestep and this node 
is connected to the existing network with exactly m undi- 
rected links. The process is governed by an attachment 
probability kernel Qj, defined as the probability that a 
specific, newly-introduced link attaches to any node of 
degree j within the existing network. At some time t 
there exist Xj(t) nodes of degree i and we wish to com- 
pute the expected number of nodes with degree i at time 
t+1. The node- space master equations can be expressed 
in terms of the attachment kernels and are written 

(Xi(t + 1)) = Xi(t) + m@i-x(t) - m6i(f) i > m, 
(X m (t+1)) = X m (t) + l-m@ m (t), (1) 

since a new node of degree m is added to the existing 
network at each timestep and there are no nodes with 
degree less than m. 

So far we have said nothing about the attachment 
mechanism, and have made the easily geneneralizable re- 
striction that only one node is being added per timestep 
with undirected links. We now follow a similar analysis, 
but retain the node-node linkage correlations that are in- 
herent in many real- world systems [IlL HJ, [H[ . Consider 
any link in a general network - we can describe it by 
the degrees of the two nodes that it connects. Hence we 
can construct a matrix L(f) such that the element Lij(t) 
is equal to the number of links from nodes of degree i 
to nodes of degree j for i ^ j at some time t. To ease 
the mathematical analysis below, the diagonal element 
Li.i{t) is defined to be twice the number of links between 
nodes of degree i for the undirected graph, a mathemat- 
ical convienience also observed by Boguha and Pastor- 
Satorras [l9j . For undirected networks L(£) is symmetric 
and J2i j Li.j (t) = 2M(t), twice the total number of links 
M(t) in the network which is simply mt when introduc- 
ing m links per timestep. The matrix, L, represents a 
surface describing degree-degree correlations in the net- 
work (see Figs. [1] [2] and [3]) and is called the link-space 
matrix. 

Consider one of the newly introduced links, one end 
of which is attached to the new node. The probability 



of selecting any node of degree i — 1 within the existing 
network for the other end to attach to is given by the 
attachment probability 0j_i(i). Suppose an i — 1 node 
is selected. The fraction of nodes of degree i — 1 that are 
connected to nodes of degree j is 

Li-i,j(t) 



(z-l)AViW 

The expected increase in links from nodes of degree i to 
nodes of degree j, through the attachment of the new 
node to a node of degree i — 1, is given by 

ej-i(t)£i-i,j(t) 
*i-i(t) 

Since each link has two ends, the value L^j can increase 
by a connection to an [i — l)-degree node which is in 
turn connected to a j-degree node, or by connection to 
an (j — l)-degree node which is in turn connected to an 
«-degree node. We write master equations governing the 
evolution of the link-space matrix as the evolution of the 
expected number of links from i to j degree nodes, i.e. the 
number of i «-> j links. This is the link-space formalism 
and is written for the (generalizable) case of adding one 
new node with m undirected links to the existing network 
as 



(L id (t + 1)) = Lij(t) + 



mQi-i(t)Li-i t j(t) 



X l -i{t) 

mQj-i {t)Lij-i (t) mOi (t)L it j (t) 



me 3 -(t)£ i|J -(t) 
X 3 {t) 



Xi(t) 



i,j > m, 



(L mJ (t + l)) = L TOii (t)+m0 i _i(*) + 
mQ m (t)L m .j(t) 



m<dj-i{t)L m .j_i{t) 



x m (t) 

m®j(t)L m j(t) 



j > m. 



(2) 



There are a variety of ways in which both the 
node-space and link-space master equations can be ap- 
proached. For example, a full, time-dependent solution 
could be investigated as in Section ["II CI or. using appro- 
priate initial conditions, the equations can be iterated 
over the required timescale as in Section [Vj We can also 
investigate the possibility of a steady state of the algo- 
rithm under scrutiny. To do so, we assume that there 
exists a steady state in which the degree distribution re- 
mains static and investigate a solution (4?J . Under this 
assumption, the fraction of nodes Ci(t) — Xi(t)/N(t) 
which have a given degree remains constant such that 
(Xi(t + 1)) — Xi(t) w dXi/dt — Ci when one new node is 
added per timestep (N(i) =t). It is also assumed that in 
the steady state, the attachment kernels are static too. 
We drop the notation '(£)' to indicate the steady-state 
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and can rewrite Eq. |T|) as 



d = mOi_i — m8j, i > m, 

C m 1 TO@ m . 



(3) 



The fraction of links between nodes of degree i and nodes 
of degree j can be expressed as the normalized link-space 
matrix, h,j(t) — Lij(t)/M(i), which sums to 2 in the 
undirected case. In the steady state we can assume that 
these values are static and can rewrite the link-space mas- 
ter equation (Eq. ©) as 



l, J > TO, 



_ 



-/ 

c,-_i tm J- 



+ 



e 3 



j > TO. 



(4) 



The notation '(£)' has again been dropped to indicate 
the steady-state and here the generalizable situation of 
adding one new node per timestep is considered. 

To apply the link-space formalism one starts by spec- 
ifying the model dependent attachment kernel O;. To 
investigate a time-dependent solution, the attachment 
kernel is substituted into Eqs. (Q} and ([2]). To inves- 
tigate a steady-state solution, the kernel is substituted 
into the Eqs. ([3|) and ((4]) to yield recurrence relations for 
Cj and li j respectively which can be solved analytically 
in some cases. The number of i-degree nodes is given by 
Xi(t) = i^ 1 J2k Li.k(t) which allows us to retrieve the de- 
gree distribution from the normalized link-space matrix: 



Cj(t) 



M(t)sr=iM*) 

iN(t) 



(5) 



Degree distributions of empirically observed or simu- 
lated networks typically become dominated by noise at 
large degrees reflecting the small probabilities associated 
with these values occurring. In the link-space, the situa- 
tion is exacerbated and the high i,j limit reflects connec- 
tions between these high degree nodes. This is, of course, 
rarer than the existence of nodes of either degree. Follow- 
ing the conventional approach which is applied to degree 
distributions , we can use a cumulative representation 
of the link-space to address this issue. The use of a cu- 
mulative binning technique averages over stochasticity in 
the system. With degree distributions, regression tech- 
niques (curve fitting) applied to the cumulative distribu- 
tion is used to obtain a more accurate description of the 
actual degree distribution than would be obtained from 
fitting to the empirical distribution itself [2j . The process 
can be similarly applied in the link-space. Surface fitting 
could be applied to the cumulative link-space obtained 
from an empirical network. From the cumulative fitted 
surface, a more accurate representation of the actual link- 
space could be obtained for the empirical network, from 
which a better representation of the network's correla- 
tions could be obtained. This process also allows for 



comparison between simulated and analytically-derived 
link-space matrices. We define the cumulative link-space 



matrix, 



cum l x,y 



h i to be 



cum b i,j — 7 7 l x,y- 



( 6 ) 

x=i y=j 

Note that we have not lost generality in that given a 
cumulative linkspace matrix, the actual link-space can 
be derived using variations on the following: 



H,j — cumH,j cum H-\-l y j cum li,j-\-X ~l~c 



i+X,j+l 



(7) 

The computation involved when evaluating C umh,j can be 
cut down considerably by first evaluating C umh,i, which 
in the undirected graph is equal to 2. The leftmost col- 
umn (or top row) for i > 1 can then be evaluated as 



ik.i — 



x=l 



(8) 



Note that the second term on the right hand side is a 
row sum of the normalized link-space matrix and, hence, 
quickly evaluated. Indeed this row sum can be related to 
the degree distribution q_i from Eq. ([5]). Subsequent el- 
ements can be evaluated using the following simple iden- 
tity, 



j ^ i. 



i^i—l.j ~t~c 

\irrt\i 



+ h—i,j—i- (9) 

We will now demonstrate the use of the link-space for- 
malism to study a random attachment model and the 
Barabasi- Albert (BA) model using steady-state solutions 
and the classical Erdos and Renyi random graph using a 
time-dependent solution. 



A. Random attachment model (steady-state 
solution). 

Consider first a random-attachment model in which at 
each timestep a new node is added to the network and 
connected to an existing node with uniform probability 
without any preference ('Model A' in 0, [§]) with one 
undirected link such that to = 1 and M(t) w N(t) = t. 
We assume a steady-state solution and the attachment 
kernel is 6^ = Xi/t = Cj. Substituting into Eq. ([3|), we 
obtain the recurrence relation q+i = Cj/2, which yields 
the familiar degree distribution Cj = 2 - '. Substituting 
into Eq. ([4]) yields the recurrence relation 



with In = 0. 



= (k-i,j + Zi,j-i)/3) i,j > 1) 
= (cj-i + h,j-i)/3, j > 1, 
The exact solution for l^j is 



U 



x=1 



j-1 

E 

k=l 



j-x 



x)2(x-l) 



3(*+j- 

x=2 



x) *2(x — 1) 



k o j — h \ — 1 



(3 fc 2 J 



i>i, 



(10) 



,3 > 1. 



(11) 
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where (^) is the conventional combinatorial 'choose' func- 
tion. Further mathematical details are given in Appendix 
lAl We shall make use of this solution to investigate the 
correlations of such a network in Section IIIII This nor- 
malized link-space matrix is illustrated in Fig. [T] along 
with a comparison to simulated networks using the cu- 
mulative link-space matrix. 



B. Barabasi- Albert (BA) model (steady-state 
solution). 

In the B A model [1, , at each timestep a new node is 
added to the network and connected to m existing nodes 
with probabilities proportional to the degrees of those 
nodes, i.e. Qi cx i yielding 



9<(i) 



iXjjt) = iXjjt) _ iXj(t) 
EJXjit) ~ 2M(t) ~ 2mt ■ 



(12) 



We consider the scenario when the new node is added 
with one undirected link, m = 1, and the attachment 
kernel is well approximated by 0^ « icj/2. Substi- 
tuting this into Eq. ([3|) yields the recurrence relation 



_ (i-l)cj-l 



jj^Ci-i, whose solution is Cj 



i(i+i)(»+2) • Using the same substitution, the link-space 
master equations yield the recurrence relations: 



+ C?-l)/j, 3 --i 
2 + i+j 



i,3 > 1) 



3 + j J y ' 

The exact solution for lij is obtained by algebraic manip- 
ulation of the normalised link-space matrix using the pre- 
viously derived degree distribution (further mathemati- 
cal details are given in Appendix [B| and is given by 



4Q-l)!(i-l)! 
(j+i + 2)) 



|G(i + 1) + 2G(i) - 3G(i - 1) 
+ 2 5> - + 6 ) ^ -x)-G(i-x-l)}], 

x=2 

(14) 



where G{x) 




for x > 
for x < 0. 

The first few rows of this matrix have the form: 
20 + 6)0'-!) 



hd j(j + l)(i + 2)(j + 3)' 

2j(i-l)Ci + 10) + 48 



3j(i + l)(i + 2)(j + 3)(i + 4)' 
3j 4 + 42 j 3 - 3j 2 + 246 j + 360 



(15) 



3J 9j(j + 1)0' + 2)0 + 3)0 + 4)0 + 5) ' 

We shall use this solution to investigate the correlations 
with this network in Section IIIII This normalized link- 
space matrix is illustrated in Fig. [5] along with a compar- 
ison to simulated networks. 



Erdos and Renyi random graph 
(time-dependent solution). 



The Erdos and Renyi (ER) classical random graph 
is the quintessential equilibrium network model [2j, [l0| . 
However, to employ the link-space formalism, which 
tracks the evolution of a network's correlation proper- 
ties, we must model it as an evolving, non-equilibrium 
network. To do so is a straightforward process and the 
model procedes as follows. At each timestep, we add 
one new node to the existing network. All possible links 
between the new node and all existing nodes are consid- 
ered and each is established with probability a. That is, 
a biased coin toss (Bernoulli trial) is employed for ev- 
ery node within the existing network to decide whether 
a link is formed between it and the new node. Con- 
seqently the expected number of new, undirected links 
with which the new node connects to the existing net- 
work is (m(t)) = aN(t — 1) w at and this would be 
described as an accelerating network [36j. No new links 
are formed between existing nodes. This model subse- 
quently produces a network in which the probability of a 
link existing between any pair of nodes is simply a and 
is thus representative of the ER random graph [Io| . A 
network grown to t nodes will have mean degree at which 
will also be the expected degree of all nodes in the net- 
work. Whilst the random attachment and preferential 
attachment models both have steady-state assymptotic 
behaviour, clearly this model doesn't and a full, time- 
dependent solution is required. 

Using the random attachment probability kernel 
®i(t) = Xi(t)/N(t), the node-space master equation for 
the number of nodes of degree i can be written 



(Xi(t + 1)) = Xi(t) + 
+P{m(t) 



<m(t))X 4 _i(t) (m(t))Xi{t) 



N(t) 
i}. 



N(t) 



(16) 



The second term on the right hand side reflects the ex- 
pected number of connections to i — 1 degree nodes mak- 
ing them nodes of degree i. The last term is the proba- 
bility that the new node itself is a node of degree i. This 
will be binomially distributed and, if we assume that t 
is large, we can make a Poisson approximation. Recall- 
ing that the fraction of nodes of degree i is given by 
d(t) = Xi(t)/N(t) = Xi(t)/t, we can rewrite Eq. (16]) as 

d (Mt)) , , . . e- at {at)' 1 . . 

— = acj_i(£) — aci{t) H . (17) 

at i\ 

The time-dependent solution of Eq. (|T7|) yields the degree 
distribution for this model and is simply 



(18) 



which is what we would expect for the random graph 
with mean degree of at in the large size limit (37j . We 
can similarly write the link-space master equation for this 
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FIG. 1: (Color Online) Left: the analytically-derived, normalized link-space matrix for the random attachment growth 
algorithm. This could be filled to arbitrary size but is truncated to maximum degree of 40 here. Right: comparison of 
the cumulative link-space matrices for the analytic solution and a simulation of the algorithm. The simulation comprises an 
ensemble average of 100 networks grown to 10 s nodes. The maximum degree obtained was 37. The first 30 rows (i values) of 
the cumulative link-space are illustrated and finite-size effects are noticable at high j. 




FIG. 2: (Color Online) Left: the analytically-derived, normalized link-space matrix for the preferential attachment growth 
algorithm. This could be filled to arbitrary size but is here truncated to maximum degree of approximately 10 s . Right: 
comparison of the cumulative link-space matrices for the analytic solution and a simulation of the algorithm. The simulation 
comprises an ensemble average of 1000 networks each grown to 10 7 nodes. The maximum degree obtained was 17609. Various 
rows (i values) from 1 to 3500 of the cumulative link-space are illustrated and the effects of the finite nature of the simulation 
are apparent. 



model: The last two terms reflect the expected number of new 



(m(*)>e 4 _i(t)Li_ij 



i <-> j links being formed between the new node and ex- 



(Lij(t + 1)) = Li t j(t) H — — isting nodes. Recalling that the normalized link-space 

2-1 matrix is given by h,j(t) = Lij(t)/M(t) where the num- 

{m(t))&j^i(i)Lij-i _ (m(t))Qi(t)Ljj Der Q f un k s , M(t), at time t can be approximated as 

Xj-i Xi M(t) » at 2 /2 and making use of the previously derived, 

(m(t))Oj(t)Li j time-dependent degree distribution such that P{m(t) = 



+jQ i _ 1 P{m(t)=j}. (19) 
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j} = cj(t), Eq. (fT9]) can be rewritten as 



d / at 2 lij(t) \ (at) 



At 



■(2i-ij(t)+i«-i(*)-2l i j(t)) 



2ijCi(t)cj(t) 



at 



(20) 



The solution to Eq. (|2"U|) is found by means of ansatz (the 
choice of which is explained in Section IIII[) and is 



M*) = 



2e- 2at (af) I+ J- 2 
(i-l)l(j-l)! ' 



(21) 



This is the normalized link-space matrix for a classical 
random graph with a mean degree of erf in the large size 
limit and this is illustrated in Fig [3] where a comparison 
to a simulation is also made. 



III. DEGREE CORRELATIONS AND 
ASSORTATIVITY 

The average nearest-neighbor degree (k nn )i of nodes 
of degree i can be easily obtained from the link-space 
matrix and is given by 



2-ij=l J^i,3 _ 2-ij = \J l i,3 
j=l 2-ij=l L i,3 



(22) 



This is illustrated in Fig. [4] for the steady-state solutions 
of the random attachment model, the BA model and the 
ER random graph. If the average nearest-neighbour de- 
gree (k nn )i is constant with respect to degree i, non- 
assortativity is implied. Certainly, the random attach- 
ment curve continues to increase, implying positive as- 
sortative mixing. The preferential attachment curve in 
Fig. 2] appears to asymptote to a constant value (as was 
noted in [26|]) although the fact that it decreases ini- 
tially illustrates that it is not perfectly non-assortative. 
This is a feature that would not be captured using a 
normalised Pearson's correlation coefficient which would 
suggest non-assortativity for this network 0, [38| . 

The link-space formalism allows us to address two- 
vertex correlations in a more powerful way in that we can 
calculate the conditional, two-point vertex degree distri- 
bution, P(j\i) which has been traditionally difficult to 
measure (l9j . We can write this joint probability, i.e. the 
probability that a randomly chosen edge is connected 
to a node of degree j given that the other end is con- 
nected to a node of degree i, in terms of the link-space 
matrix (normalized or not) as P(J\i) = Li^/(iXi) = 
/ J2JLi = k,j/J2%ih,j- For total number of 
edges aproximately equal to the total number of nodes, 
this can be approximated as h.j/(ici). This is illustrated 
in Fig. [5] for the scenarios of random and preferential at- 
tachment where the non-trivial nature of the correlations 
present within these networks is evident. 

Consider selecting an edge at random in the network. 
If we then select one end of this edge at random, the prob- 
ability that this node has degree j will be proportional 



to j since higher degree nodes have, by definition, more 
links connected to them than low degree nodes. Consider 
now only a subset of all edges with one end attached to 
a node of degree i. If there were no correlations present, 
the probability that the other end is attached to a node 
of degree j is again proportional to j [3^|. The crite- 
rion of perfect non-assortativity for a network with equal 
numbers of nodes and links can be described as 



-, W. 



(23) 



This criterion is illustrated in Fig. [5] for the random at- 
tachment and BA models and confirms that neither is 
perfectly non-assortative. 



A. Perfect non-assortativity 

It is interesting to ask whether a perfectly non- 
assortative network can be generated. Assuming that 
such a network will not have equal numbers of nodes and 
links, we must rewrite the conditions of non-assortativity 
accordingly for total number of edges M: 



P{jli) = P{j) i± -1^ 



(24) 



Recalling our definition of the normalized link-space ma- 
trix that li j = Li j/M, we can express this conditional 
probability P(j\i) in terms of this matrix as 



pfj\i) = hit = Mhd. 

Wl ; iXi iNa 
As such, we can now write for L , 



1,3 



M 



(25) 



(26) 



That is, for any degree distribution, a normalized link- 
space matrix can be found which is representative of a 
perfectly non-assortative network (48[. This might pro- 
vide an alternative to the network randomization tech- 
nique of Maslov and Sneppen to provide a "null model 
network" [33]. The expression of Eq. (|26jl provides the 
ansatz solution for the normalized link-space matrix to 
the master equations for the ER random graph in Sec- 
tion nsi 



IV. DECAYING NETWORKS 

Although somewhat counter-intuitive, it is possible to 
find steady states of networks whereby nodes and/or links 
are removed from the system. Aside from the obvious sit- 
uation of having no nodes or edges left, we would like to 
investigate the possible existence of a network configu- 
ration whose link-space matrix and, subsequently, node 
degree distribution are static with respect to the decay 
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FIG. 3: (Color Online) Left: the analytically-derived, normalized link-space matrix for the Erdos and Renyi (ER) random 
graph with mean degree equal to 5. This could be filled to arbitrary size but is here truncated to maximum degree of 30. Right: 
comparison of the cumulative link-space matrices for the analytic solution and a simulation of the algorithm. The simulation 
comprises an ensemble average of 500 networks each consisting of 10 7 nodes. The probability that a link exists between any pair 
of nodes is a — 5 x 10~ 7 and the maximum degree obtained was 25. The first 20 rows (i values) of the cumulative link-space 
are illustrated and, again, the finite nature of the simulation is apparent for high j. 




degree i 



FIG. 4: (Color Online) The mean degree of the nearest neigh- 
bours {k nn )i of nodes of degree i as a function of i for the 
analytic solutions to the random attachment algorithm (Sec- 
tion |IT3, the BA model (Section |HB"1) and the ER random 
graph (Section III C|) . 



process. The concept that decay processes are highly in- 
fluential on a network's structure has been considered be- 
fore although this has typically only been investigated in 
conjunction with simultaneous growth [13, SH E2] ■ Here 
we shall employ the link-space formalism to examine the 
effect of some simple, decay-only scenarios, specifically 
the two simplest cases - random link removal and ran- 
dom node removal. 



A. Random Link Removal (RLR) 



Consider an arbitrary network. At each timestep, we 
select a fixed number, w, of links at random and remove 
them. We shall implement the link-space formalism to in- 
vestigate whether or not it is possible that such a mecha- 
nism can lead to stationary structure. Consider the link- 
space element Lij (t) denoting the number of links from 
nodes of degree i to nodes of degree j. Clearly this can 
be decreased if an i <-> j link is removed, i.e. a link that 
connects a degree i node to a degree j node. Also, if a 
k <-> i link is removed and that i node has further links 
to j degree nodes, then those that were i «-» j links will 
now become (i — 1) <-> j, similarly for k <-> j links being 
removed. However, if the link removed is a k «-> (J + 1) 
link and that j + 1 degree node is connected to a degree i 
node, then when the j + 1 node becomes a degree j node, 
the (j + 1 ) <-> i link will become an j <-> i link, increasing 
Lij. Let us assume that we are removing links at ran- 
dom from the network comprising N(t) vertices and M(i) 
links. A non-random link selection process could be in- 
corporated into the master equations using a probability 
kernel. The master equation for this process can be writ- 
ten in terms of the expected increasing and decreasing 
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degree j degree j 

FIG. 5: (Color Online) The conditional probability P(j | i) that one end of a randomly selected link has degree j given that 
the other end is of degree i over the range i — {1, 6, 11, 16, 51}. The left plot is for the random attachment algorithm and 
the right is preferential attachment. The criterion of non-assortativity of Eq. ([23} is denoted with red circles. 



contributions: 



(L id (t + 1)) = L itj (t)+wJ2 



Lk,j+i(t)jL tJ+1 (t) 
M(t)(j + l)X j+1 (t) 



k 

'£ 



^ M(t)(i + l)X i+1 {t) 

Lk,j(t)U -!)-M*) 
M(t)(j)Xj(t) 

L Ki (t)(i-l)L itj (t) 



M(t)(i)Xi(t) 



vLj,j(t) 
M(t) 



(27) 



This simplifies to 



(£ij(i + l)> = L itj (t) + 



wjL %0+1 (t) 



3W ' M(t) 
wiL i+ ij(t) w(i - l)L hj (t) 



M{t) 



M(t) 



w(j - l)Li t j(t) wLij(t) 



(28) 



M(t) M(t) 

where the last term refers to the physical removal of an 
i <-> j link. 

To investigate the possibility of a steady state solution, 
we make a similar argument as before for growing net- 
works but this time for the process of removing w links 
per timestep: 

L hJ (t) = li,jM(t) 

= k,j(M -wt), 



At 



-wl hj 
L itj (t + l)-L i>:j {t). 



(29) 



The expression of Eq. 



can be reduced to 



ih+l,j + jh.j + l 



^,3 



(30) 



As such, (twice) the fraction of 1 <-> 1 links, does 
not reach a steady state. This might be expected, as 
the removal process for such links requires them to be 
physically removed as opposed to the process by which 
the degree of the node at one end of the link is being 
decreased. Therefore, the value ly t \ increases in time. 
However, we can investigate the properties of the rest of 
the links in the network which do reach a steady state by 
neglecting these two node components (see Appendix [C] 
for details). The normalized link-space matrix can be 
written for this system for i + j ' =/= 2 as 



= A (i + j-3)! 
^ 2*+i (i -!)!(>■ -l)f 



(31) 



The summation over this link-space matrix does not con- 
verge, i.e. — ► oo. However, we can still derive 
the degree distribution using Eq. ([5]) and normalize this 
such that J^i c i = 1 ( see Appendix [C] for details): 



Cl 



log(2) 
l + log(2) : 



(l + log(2))i(i-l) 



i > 1. 



(32) 



Consequently, the first moment of the distribution di- 
verges also, yj i = (fc) ^ oo. It is interesting that 
the process of randomly removing links generates a net- 
work which exhibits power-law scaling of exponent 2, 
corresponding to a degree distribution significantly over- 
skewed with respect to the preferential selection process 
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of the BA growth algorithm (exponent 3). By considering 
the average degree of the neighbours of nodes of degree 
i, as discussed in Section we can see that the ran- 
dom link removal algorithm generates highly assortative 
networks. 



B. Random Node Removal (RNR) 



In a similar manner to Section llV A[ we will now dis- 
cuss the possibility of creating such a steady-state struc- 
ture via a process of removing nodes (with all of their 
links) from an existing network at the rate of w nodes 
per timestep. Clearly, removing some node which has a 
link to an (i + l)-degree node which in turn has a link to 
a j-degree node can increase the number of i <-> j links 
in the system. The other processes which can increase 
or decrease the number of links from i degree nodes to 
j degree nodes can be similarly explained. We consider 
selecting a node of some degree k for removal with some 
probability kernel ®k (as in the growing algorithms of 
Section [TTJ) . The master equation for such a process can 
thus be written for general node selection kernel: 



(L itj (t + 1)) = L id (t) - 



Xi(t) 



{i + l)X i+1 (t) 



+ L 



k,j+i 



it) 



jL hj+1 (t) 
(j + l)X j+1 (t) 



x k (t) 



iXi(t) 



U-l)L itj (t) 



(33) 



We now specify a kernel for selecting the node to be re- 
moved, namely, the random kernel although the approach 
is general to any node selection procedure. Selecting a 
node purely at random leads to Ofc = c^. The steady- 
state assumptions must be clarified slightly. As before 



L hJ (t) = kjM(t). 



(34) 



However, because we can remove more than one link 
(through removing a high degree node for example) we 
must approximate for M(t). Using the random removal 
kernel, we can assume that on average, the selected node 
will have degree equal to the mean degree of the network, 
(k(t)) = 2 xlt) ■ We can use this to write the number of 



remaining links in the network as 

Lij(t) = lu(M(0) - w(k(t))) 



k,j(M 



2wM(t) 



dL 



dt 



2wM(t) 
L itj (t + i)-L itj (t). 



(35) 



Making use of the link-space identities and Eq. (|35D . 
the master equation Eq. (|33[) can be written in simple 
form for the steady state as 



ik+i,j + jk,j+i 



(36) 



Clearly, this is identical to Eq. (|30[) for the random link 
removal model of Section IIV Al and, consequently, the 
analysis of the degree distribution will be the same too. It 
is interesting that the random node removal and random 
link removal lead to the same degree correlations. 



V. MIXTURE MODEL 

In this section we introduce a simple model that makes 
use of only local information about node degrees as mi- 
croscopic mechanisms requiring global information are 
often unrealistic for many real- world networks [43| . It 
therefore provides insight into possible alternative mi- 
croscopic mechanisms for a range of biological and social 
networks. The link-space formalism allows us to iden- 
tify the transition point at which lower power-law ex- 
ponent degree distributions switch to higher exponent 
distributions with respect to the BA preferential attach- 
ment model. While similar local algorithms have been 
proposed in the literature [ll|, IHL EH], the strength of 
the approach followed here is the ability to describe the 
inherent degree-degree correlations. 

It is well known that a mixture of random and preferen- 
tial attachment in a growth algorithm can produce power 
law-degree distributions with exponents 7 6 [3, 00) when 
new links are only established between the new node and 
the existing network [49| . It has often been assumed that 
a one step random walk replicates linear preferential at- 
tachment [43L [45l] . This is not true. A one step random 
walk is in fact more biased towards high degree nodes 
than preferential attachment [5C| as can be easily seen 
by performing the procedure on a simple hub and spoke 
network. In this case, the probability of arriving at the 
hub tends to one for increasingly large networks rather 
than a half as would be appropriate for preferential at- 
tachment. We can use this bias to generate networks with 
degree distributions that have lower power-law exponent 
than the preferential attachment model,a phenomenon 
also observed by Evans and Saramaki [4J]. A mixture 
of this approach with random attachment results in a 
simple model that can span a wide range of degree dis- 
tributions. This one-parameter, growing network growth 
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model, in which we simply attach a single node at each 
timestep with a single undirected link, does not require 
prior knowledge of the existing global network structure. 

The algorithm proceeds explicitly as follows: (i) pick a 
node k within the existing network at random; (ii) with 
probability a make a link to that node; otherwise (iii) pick 
any of the neighbours of k at random and link to that 
node. Hence this algorithm resembles an object or 'agent' 
making a short random-walk. This is very similar to the 
Growing Network with Redirection model introduced by 
Krapivsky and Redner [ll| except that the model here 
employs undirected links which necessitates a random (as 
opposed to deterministic) walk. Fig. [6] shows examples 
of the resulting networks and the corresponding (cumu- 
lative) degree distributions. Interestingly, a — yields 
a network that is dominated by hubs and spokes while 
a = 1 yields the random attachment network. Interme- 
diate values of a yield networks which are neither too 
ordered nor too disordered. For a ~ 0.2, the algorithm 
generates networks whose degree-distribution resembles 
the BA preferential-attachment network (see Figs. [6] and 
ED- 

Our analysis of the algorithm starts by establishing the 
attachment kernel, (t) , which in turn requires properly 
resolving the one-step random walk. The link-space for- 
malism provides us with an expression for the probability 
P/(i) associated with performing a random walk of length 
one and arriving at a degree i node [38| : 

Xi(t)Li^(t) X 2 (t)L 2 ,j(t) X 3 (t)L 3 4t) 
l[ ' N(t)Xi(t) 2N(t)X 2 (t) + 3N(t)X 3 (t) + 

(37) 

This can also be written (38| as P,'(t) = { k 1 (t) )» 

where the average is performed over the neighbours of 
nodes with degree i. Note that this quantity does not 
replicate preferential attachment, in contrast to what is 
commonly thought [4^,|4^|. Defining /3,(i) as 

1 -I U,kW hAt) 

i Ci(t) k nn (t) 1 X) fc £i,fe(*) Z)*Afe(*) 

(38) 

yields 

0((t) = ac t {t) + (1 - a)/3i(t)ici(t) . (39) 

To investigate a possible steady-state, we could substi- 
tute the above equations into Eqs. ([3]) and ([4]). However 
the non-linear terms resulting from [3 imply that a com- 
plete analytical solution for ij a and Cj is difficult. We 
leave this as a future challenge, but stress that our for- 
malism can be implemented in its non-stationary form 
Eq. |(5J| numerically by iteration with very good efficiency 
[38| . yielding the degree distributions shown in Fig. [7l 

We can now use the link-space formalism to deduce the 
parameter value at which our algorithm yields the BA 
degree distributionin the long-time limit. At this value 
a = a c , the node-degree distribution goes from lower 
to higher power-law exponents with respect to the BA 



model. For this parameter value, the attachment prob- 
ability to nodes of various degrees is equal for both our 
mixture algorithm and the BA model. Using Eqs. (fT2"|) 
and (f3"9")> . we have | = a c + (1 — a c )(3ii, and hence for 
large i this yields a c = 1 — ^g-. We could then proceed 
to use the exact solution of the link-space equations for 
the preferential-attachment algorithm, in order to infer 
Pi in the high i limit. However, since (3i can be expanded 
in terms of hj as shown in Eq. (|38[) and decays very 
rapidly as i, j become large, we can obtain a good approx- 
imation by using only the first two terms of Eq. (|15p . This 
yields (3 w 0.66. Hence, the critical value at which this 
simple model approximates the BA degree distribution is 
a c = 0.25, illustrated with the inset plot of Fig. [7] 



VI. CONCLUSION 



We have developed a new formalism which accounts 
for degree-degree correlations in networks. We have em- 
ployed the formalism to produce analytic solutions to 
the link-space matrix for the random attachment, BA 
and ER network models allowing detailed analysis of the 
degree correlations therein. We have introduced a cu- 
mulative implementation of the formalism which can be 
used to compare link-space matrices and might also be 
applied to empirical networks. We have shown that a per- 
fectly non-assortative network can be generated with ar- 
bitrary degree distribution. We have also demonstrated 
the possiblility of a steady-state of the degree distribu- 
tion and of the two-point degree correlations for simple 
decaying networks through deriving the static, normal- 
ized link-space matrix for such a process. Employing the 
link-space formalism allowed us to accurately describe a 
simple one-parameter network growth algorithm which is 
able to reproduce a wide variety of degree distributions 
without any global information about node degrees. We 
have used the framework to show that a one-step random 
walk does not replicate preferential attachment except 
in the particular case of a perfectly non-assortative net- 
work. Indeed this in itself represents a criterion for per- 
fect non-assortativity. Whilst the present paper focuses 
on introducing and illustrating the use of the link-space 
formalism for a variety of model networks, we stress that 
its applicability is far more general. 
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FIG. 6: (Color Online) Top: Networks generated using the one-parameter, local information growth algorithm with a = 1 
(left), a = 0.2 (center), a = (right). Initial network seed comprised two nodes and one link. Networks drawn with Pajek [46j]. 
Bottom: Cumulative degree distributions for the same networks grown to 10 5 nodes and ensemble-averaged over 100 networks 
per a value. 




APPENDIX A: 



degree i 



FIG. 7: (Color Online) Degree distributions generated us- 
ing the link-space analysis of our one-step algorithm. Dashed 
line: results for the Barabasi-Albert (BA) [|| preferential- 
attachment algorithm. Our one-step algorithm is closely re- 
sembles the BA results with parameter value a = 0.25 (see 
inset). 



EXACT SOLUTION OF RANDOM 
ATTACHMENT 



For the random attachment algorithm, where one new 
node is added to the existing network with one new link 
at each timestep, the link-space master equation can be 
expressed for the steady state as 



h 



3 
1.1 



J! 

Cj_i + hj-i 



0. 



i,3 > !, 
3 > h 



(Al) 



It is easy to populate the link-space matrix numerically, 
just from the degree distribution Cj = 2~ l obtained from 
solving the node-space recurrence relation and from the 
link-space master equation Eq. (|A1[) . At first glance, the 
solution to the master equation I Al I would be of the form 
(check by substitution) 



u 



4 2 i+ J 3 i+ J 



(A2) 



However, the boundary conditions, which could be inter- 
preted as influx of probability into the diffusive matrix, 
are such that this does not hold. We can actually solve 
the normalized link-space matrix for this model exactly. 
Consider the values Ci as being influxes of probability 
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FIG. 8: The paths of probability flux from c x _i influencing 

element hj. Each arrow (step) represents a further factor of 
l 

3 • 



into the top and left of the link-space matrix. We can 
compute the effect of one such element on the value in 
the matrix l^j. Each step in the path of probability flux 
reflects an extra factor of |. First, we will consider the 
influx effect from the top of the matrix as in Fig. [5J The 
total path length from influx c x -i to element lij is sim- 
ply i+j — x. The number of possible paths between c x _i 
and element ijj can be expressed as (•'"J-x^ ). We can 
similarly write down the paths and lengths for influxes 
into the left hand side of the matrix. The first row (and 
column) can be described as 



h,j - 



c i~i + h,j-i 
3 

3 



fe=i 



(A3) 



Subsequent rows can be similarly described. So, for i, j > 
1 an element can be written: 



(i-l+j-x\ i /i-l+j-x 

\ j—x I V i—x 

^(i+]-x)2(x-l) 2-^i 3(i+j~x)2( 

x—2 x—2 



E 



7, ' ^ .(A4) 



The results can be observed in Fig. [5] where a comparison 
to the numerically populated link-space matrix is made. 



FIG. 9: (Color Online) Comparison of the numerically de- 
rived link-space matrix (markers) and the analytic solution 
(solid lines) for random node attachment for the first 20 rows 
(i values) of the link-space matrix. The numerical imple- 
mentation populates the link-spcae matrix directly from the 
degree distribution and the steady-state link-space recurrence 
relations. 



APPENDIX B: EXACT SOLUTION OF 
PREFERENTIAL ATTACHMENT 



In the BA model, when adding one new link with one 
undirected node per timestep, the steady-state solution 
of the node-space master equation leads to the degree 
distribution 



c, 



i(i + !)(* + 2)' 



(Bl) 



which can be checked easily by substitution. Substitut- 
ing the attachment probability kernel 0j sa iCi/2 into the 
link-space master equation Eq. ([4]), we obtain the mas- 
ter equations for the link-space for this network growth 
algorithm: 



4 j ~ n , „• , ■ ! *! 3 > t; 



h,j - — 
h,i = 0. 



2 + i + j 



3 + j 



J>1, 



(B2) 



Again, it is easy to populate the matrix numerically just 
by implementing the node and link-space equations. At 
first glance, the solution to the master equation Eq. (|B2|) 
would be of the form (check by substitution) 



.4 



i(i + l)j(j + iy 



(B3) 
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FIG. 10: The components of flux of probability around the 
link-space matrix. This is general to any attachment kernel. 
An element within the link-space matrix can be built up from 
contributing elements and the appropriate factors. 



where A is a constant. This would imply for the degree 
distribution 



A 



i 2 (i + l)' 



(B4) 



Summing over the entire node-space would give A — 
■ However, the boundary conditions (which could 
be interpreted as influx of probability into the diffusive 
matrix) are such that this solution does not hold. This is 
evident when comparing Eq. (|B4[) with Eq. (|B1[) . which 
compares to simulated networks well. 

We can obtain an exact (although somewhat less 
pretty) solution by tracing fluxes of probability around 
the matrix and making use of the previously derived de- 
gree distribution. We can rewrite our link-space master 
equation, (|B2|) . in terms of vertical and horizontal com- 
ponents: 



h,i = 0, 



(B5) 



where 



Ci-l 



If H 

Cj-l 



} ' 



- i,3 



1 + % + 2f 



(B6) 



By considering the probability fluxes as shown in 
Fig. 1101 we can write the individual elements in the link- 



space matrix as 



'1.;/ 



u 



y=2 
j 



y=2 
hi = 0. 



( k-l,y^i,y ^ ) ^ IT ^ l ' x ' 



x=y+l 



x=2 



(B7) 



Note that we have yet to introduce the attachment 
probability kernels and the analysis so far is general. Us- 
ing the preferential attachment kernel, 0^ « we can 
write our component-wise factors for the master equation 
as 



i - 1 
i + j + 2' 

T = J ' ~ 1 



(B8) 



Substituting Eq. (|B8[) into Eq. (|B7|) and using the pre- 
viously derived degree distribution of Eq. (jBip yields for 
the first row 



h. 



E 



— r- 1 

l)(v + l) 11 



4(j - 1)! 
(i + 3)! 



E(y + 2) 



y=2 

2(j + 6)(j-l) 

" j(i + i)(i + 2)(j + 3)' 

Subsequent rows can be written as 
(7-l)!(3 + i)! 



(B9) 



'i.7 — ■ ^i.l " 



(2 + i+ j)! 



y=2 

Cr — i)i 



(j-1)! (1 + t + i/)! 



(2 + i + j)! 



(2 + i + j)! 
(3 + i)!ii,i, 



(*-l) £>-!,,, 



y=2 



(2/ " I)' 



(BIO) 



Rewriting this gives 



kj ( 2 ( +i+j)!^ A "' + i: " 



(Bll) 



where the meaning of Ki and follows from 

Eq. (|B10p . Clearly, we can write Eq. (|B1 1[) for k_i )J( 
as 



(y-i)! 

(1 + * + »)! 



i— 1,1/ 



(B12) 
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Substituting Eq. (|Bl"2f into Eq. (|B10|) yields 



k < j = + c- 1 ) £(*<-* + 

y — 2 

(B13) 

In order to solve this recurrence relation, we define an 
operator for repeated summation, S" y , such that 



sjMv)) = £/(y), 

y=2 

j Vn Vn-1 V4 2/3 J/2 

sum) = £ £ £ -EEE/M- 

y„=2 y„_i = 2 y„_ 2 =2 y 3 =2 y 2 =2 y=2 

(B14) 

The first subscript denotes the initial variable to be 
summed over and the second the final limit. A few ex- 
amples of this operation will clarify its use: 

S°, y (f(y)) = /(»), 

= Si, y {y) - Si, y {l) 



^L(i) = s(i 3 -i). 



^■,3/(y) 



1 

6 ( 

i(i + 1) 



i 



= «(r + 3r-4?). 



(B15) 



We can use this operator in our expression for the element 
U.j in Eq. (|B13[) and expand to the easily derived value 



Ki + (l- l)Sj x y(Ki-l + Ei-t x y) 

Ki + (i-l)Ki-iS jlV (l) 







I - {J 


-iV- i 


l - J ~ (2 + 




u 




(2 + 




Hi- 


-l)(i- 


U 


-1)! I 


(2 + 




Hi- 


-l)(i- 


Hi- 


-l)(t- 


Hi' 


-l)(i- 


Hi' 


-l)(i- 



X 4 + (i-l)AVi% y (l) 



j,v v 



(B16) 



We can express A2 !a in terms of the operator S too: 

E 2 , y = iSy,y(2Sy,y(l) + Sy^)). (B17) 



The element i^j can be expressed as 
(7-1)1 



- ■ , ,„ i=2 (m-l)! Am ^ W 
+4(z-l)!((2^(l) + ^( 2 /)) 



This form is somewhat obtuse as the calculation of the 
operator values is less than obvious. However, we can 
transform to a more easily interpreted operator W(n) 
analogous to S but with different limits such that 

WiMy)) = E/(y), 

j Vn Vn-1 1/4 V3 V2 

w?jM) = E E E ■ ••£££/(*)■ 

S/n = l 2/„_l=l K„-2=l 2/3 = 1 j/2 = l y=l 

(B18) 

Whilst, at first glance, it looks like little progress has been 
made, we only need evaluate the repeated operation on 
initial function f(y) — 1. This is exactly solvable and 
we can rewrite in terms of a function G(n), dropping the 
superfluous y subscripts: 

Gj(n) = 

C (j+n-l)< f Q> n 

GM = «!0-D! ;° rn - n U (B19) 
for n < 0. 



The inductive proof associated with Eq. (|B19j) can be 
understood by path counting for some repeating binomial 
process and is an intrinsic property of the combinatorial 
choose coefficient. Consider a repeated coin toss over x 
steps. The number of ways of achieving n + 1 successes 
after these x iterations is ( n T -J ■ Now, the occurrence of 
this last success could have happened on the (n + l) th 
iteration or the following one, or any of the subsequent 
iterations till the x th one. For the last successful outcome 
to occur on the m th step, n successful outcomes must 
have occurred in the previous steps. The number of ways 
this could have occurred is ( m ~ 1 ) ■ Clearly, summing over 
all possible m values, the total possible paths resulting 
in n + 1 successes must equate to \ n "Li) ■ 

For clarity, this is depicted in Fig. [TTJ To reach point 
B from A in the binomial process, one of the steps w, x, y 
or z must be traversed, after which there is only one route 
to B. Consequently, the number of paths between A and 
B utilising step w is the same as the number of paths be- 
tween A and W. Similarly, the number of paths between 
A and B utilising step x is the same as the number of 
paths between A and X and so on. The number of paths 
between A and B can be built expressed as the sum of 
the paths A -> W, A -> X, A -> Y and A -> Z. 

We incorporate this behaviour into our proof for the 
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FIG. 11: A binomial process over seven steps. The number 
of paths between A and B can be expressed as the sum of the 
paths A -> W, A -> X, A -» Y and A -> Z. 



FIG. 12: (Color Online) Comparison of the numerically de- 
rived link-space matrix (markers) and the analytic solution 
(solid lines) for preferential attachment. The numerical im- 
plementation populates the matrix from the solution of the 
degree distribution and the link-space recurrence equations. 



solution of Gj(n). 



j + n-1 



GM = 

G 3 {n+l) = ^Gj(n) 

'j + n — 1 



E 



j + n 

71 + 1 



(B20) 



As G(l) = (j) = j, this inductive proof holds for all n 
and the following relations hold: 

S n (l) = G(n)-G{n-l), 

S n (j) = G(n + 1)-G(n-1). (B21) 

We can now write the element in our link-space matrix 
for preferential attachment exactly as our function G(n) 
is easily evaluated as 



= 4(j 1)1 / + _ _ 

(.?+* + 2)! ^ 

+ \ E( fc - !)( fc + 6 ) ( G (* - *) - G(i - fc - 1)) ] . 

fc=2 / 

(B22) 



A comparison of this solution is made to a numerically- 
populated link-space matrix in Fig. 1121 



APPENDIX C: DEGREE DISTRIBUTION OF 
DECAYING NETWORK 

The link-space master equation for the steady state of 
the decaying networks of Section IIVI is written as 



h. 



1,3 



(CI) 



The number of 1 <-> 1 links does not reach a steady state 
for this process. However, we can investigate the proper- 
ties of the rest of the links in the network which do reach 
a steady state by neglecting these two-node components. 
In a similar manner to [ll| we can make use of a sub- 
stitution to find a solution to this recurrence equation, 
namely, for i + j ^ 2 as 



lJ _ r «(i-l)!C?-l)f 



From this, we can obtain 



— r i+l,j 



+ r, 



This has the simple solution: 



2 l +J 



(C2) 
(C3) 
(C4) 



The link-space for this system can be written for i+j ^= 2 
as 



A (i + j -3)! 
2 i+j (i-l)!C?-l)!" 



(C5) 



Although the summation over this link-space matrix di- 
verges, J2i J2j h,j °°i we can use the form of Eq. (|C5|) 
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to infer the shape of the degree distribution. Recalling 
that 



N i ' 



we can write for the degree distribution, neglecting the 
two node components in the network, as 



ci 



MAy, 0-2)! 



N f^ 2 (k- l)!2 1 + fc ' 



AT4 2-^, 



Ni ^ (i- l)!(fc - l)!2 4 + fc ' 



i > 1. (C7) 



The c\ value can be solved by considering the Maclaurin 
Series of the function log(l + x) (as a historical note, this 
was done by Mercator as early as 1668) and evaluating 



for x = — i: 



log(l + x) = J2 



k' = l 

iog G) = -e^V 

= -log(2). 



(C8) 



Rearranging Eq. (|C7|1 and letting k = k' + 1 yields 



ci 



A/A 

MA 

MA 
4iV 



E 



(fc - l)2!+ fe 



oc ^ 

E 



fe'=i 



k'2 k '+ 2 



log(2). 



(C9) 



For node degree greater than one, the solution of (|C7[) re- 
quires a simple proof by induction. Consider the function 
Q(i') defined for positive integer i' as 



oo /i'+k'\ 

o(0 = EW 



(CIO) 



fc'=0 



We can write + 1) with similar ease as 

oo n'+k' + l\ 

0(*' + i) = EUrr- 



fc'=0 



(Cll) 



This leads to 



(C6) 2Q(i'+l)-Q(i') = J2 



k'=0 



i' + k'+l\ fi' + k'\\ 1 



^ U'-l J2 k '+' 1 '' 

k' = l v 7 

A quick substitution of fc" = fe' — 1 leads to: 



2Q(i' + 1) - Q(i') 

Q(i' + 1) 
0(1) 



- E 

fc"=0 

- Q(i' + 1) 

= 0(0, 

= 2 

= 0(0, 



i' + k" + 1 



fc' y y 2 fe, + 1 ' 

(C12) 

i 



2k"+i'+l 



(C13) 



which holds of for all i' . Rearranging Eq. (|C7[) and using 
some simple substitutions, i' = i — 2 and fc' = fc — 1, we 
can derive the following: 



MAA (t + fc- 3)! 
IvT^ (i-l)!(/fc-l)!2 l + fe 

MA ^ (i + fc-3)! 
iVz(i - 1) £^ (i - 2)!(fc - l)!2 i + fc 



MA 



fc=i 



fc' 

2*'+As' 



+3 



fe'=0 



MA 1 

4iVj(j - 1)' 



(C14) 



We can normalize this degree distribution such that 
Ci = 1 and so for the decaying network without the 
two node components (the 1 <-> 1 links), the degree dis- 
tribution can be expressed as 



AN 



Cl 



M(l + log(2))' 

log(2) 
1 + log(2) ' 
1 

(l + log(2))i(i-l)' 



i > 1. 



(C15) 
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